2/19 

( GENERATE DISCOURSE TREE Y 



200 



RECEIVE INPUT 



202 



204 



I BREAKTEXJ INTO EDUS V 
I PUT EDUS INTO INPUT LISTS V 206 



PERFORM SHIFT-REDUCE 
ALGORITHM BASED ON 
DECISION RULES 



208 



'DISCOURSE TREE^ 
JOR INPUT TEXT, 



-210 



FIG. 2 



/ INPUT TEXT / 



-301 



DISCOURSE 
SEGMENTER 



DEClsFON RULES 



EDUS 



DISCOURSE 
PARSER „. 



"DECISION RULES 



-300 
-302 



-304 



-305 



DISCOURSE 
TREE 



FIG. 3 



3/19 




5/19 




UNDEFINED 




UNDEFINED 


UNDEFINED 




UNDEFINED 


P1 




P2 



6/19 

RELATION-NS 



UNDEFINED 
UNDEFINED 
P1 



UNDEFINED 




UNDEFINED 


UNDEFINED 




UNDEFINED 


P1 




P2 



RELATION-BELOW-NS 



NUCLEUS 

SPAN 

PI 



UNDEFINED 
UNDEFINED 
P1 



SATELLITE 
RELATION 
P2 



SATELLITE 
RELATION 
P2 



UNDEFINED 




UNDEFINED 


UNDEFINED 




UNDEFINED 


P1 




P2 



UNDEFINED 
UNDEFINED 
P2 



UNDEFINED 




UNDEFINED 


UNDEFINED 




UNDEFINED 


P1 




P2 



RELATION-BELOW-SN 



SATELLITE 
RELATION 
P1 



SATELLITE 
RELATION 
PI 





NUCLEUS 

SPAN 

P2 




UNDEFINED 
UNDEFINED 
P2 



UNDEFINED 
UNDEFINED 
P1 



UNDEFINED 
UNDEFINED 
P2 



UNDEFINED 




UNDEFINED 


UNDEFINED 




UNDEFINED 


P1 




P2 



RELATION-NN . 



RELATION-BELOW-NN 




FIG. 5 



7/19 

y ^600 

( GENERATE DECISION RULES \ 
\FOR DISCOURSE SEGMENTER J 



BUILD TRAINING CORPUS \ ~ 







ASSOCIATE FEATURES WITH 
EACH LEXEME 




(LEARNING 
CASES) 



\APPLY LEARNING ALGORITHM^ ' 



(DISCOURSE SEGMENTEFTf uuo 
K^ DECISION RULES ^S 

FIG. 6 

Rule 1: if pos(-1) = ")" ~ previous-marker = "(" 

then end-pawn 
Rule 2: if pos(-1) = "," ~ pos (0) = CD 
pos(+1) = NNS 

then start-paren 
Rule 3: if pos(-1) * DOT ~ pos(0) = DOT 

pos(+1) * DOT ~ pos(+1) * DOUBLEQUOTE 
then end-sentence 
Rule 4: if pos(+2) = VBD ~ word(+1) = "and" 
then edu-break 

Rule 5: if word(+1) = "until" ~ isThereAnyVerbBeforeNextPotentialBreak 

then edu-break 
Rule 6: if pos(O) = "." ~ previous-marker = "while" 

then edu-break 
Rule 7: if pos(1) = DOT 

then nothing 
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[Some of the executives who attended yesterday's session weren't a surprise. 
Tenneco Inc. Chairman Michael Walsh, for instance, is a staunch Democrat 
who provided an early endorsement for Mr. Clinton during the presidential 
campaign. Xerox Corp.'s Chairman Paul Allaire was one of the few top 
corporate chief executive officers who contributed money to the Clinton 
campaign. And other, such as Atlantic Richfield Co. Chairman Lodwrick M. 
Cook and Zenith Electronics Corp. Chairman Jerry Pearlman, have also 
previously voiced their approval of Mr. Clinton's economic strategy. 1 ] [But 
some faces were fresh. Norman Augustine, the chairman of defense 
contractor Martin Marietta Corp., is a registered Republican who has never 
stood behind Mr. Clinton. It was also the first formal show of support by 
Rand Araskog, the chairman of ITT Corp. 2 ] 
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The documentation is typical of Epson quality: excellent 
Documentation is excellent 

All of our design goals were achieved and the delivered performance 
matches the speed of the underlying device. All design goals were achieved. 

Reach's E-mail product MailMan, is a message-management system 
designed initially for VINES LANs that will eventually be operating system- 
independent. MailMan will eventually be operating system-independent. 

Although the modules themselves may be physically and/or electrically 
incompatible, the cable-specific jacks on them provide industry-standard 
connections. Cable-specific jacks provide industry-standard connections. 

ingress/Star prices start at $2, 1 00. 
Ingress I Star prices start at $2, 1 00. 
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Rule 1: IF previous operation was not "Reduce" AND 
previous operation was not "Shift" AND 
previous operation was not "AssignType" AND 
the input list starts with a syntactic constituent of type WHPP 
THEN drop from the input list the words subsumed by WHPR 

Rule 2: IF there is only one tree in the stack AND 
previous operation was "Reduce" AND 
the syntactic label of the tree in the stack is NP-A AND 
the input list starts with a syntactic constituent of type WHNP 
THEN drop from the input list the words subsumed by WHNR 

Rule 3: IF previous operation was "Drop" AND 

the input list starts with a syntactic constituent of type ADJP AND 
the input list does not start with a syntactic constituent of type NP 
THEN drop from the input list the words subsumed by ADJR 
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